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ABSTRACT. The set of all error-correcting block codes over a fixed alphabet 
CN , with q letters determines a recursively enumerable set of rational points in the 

^ \ unit square with coordinates {R,5):= (relative transmission rate, relative minimal 

lO \ distance). Limit points of this set form a closed subset, defined hy R < CKq(5), where 

Q ■ oiq{5) is a continuous decreasing function called asymptotic hound. Its existence 

was proved by the first-named author in 1981 ([Manl]), but no approaches to the 
Q ! computation of this function are known, and in [Man5] it was even suggested that 

^ \ this function might be uncomputable in the sense of constructive analysis. 



In this note we show that the asymptotic bound becomes computable with the 
assistance of an oracle producing codes in the order of their growing Kolmogorov 



;_j , complexity. Moreover, a natural partition function involving complexity allows us 



to interpret the asymptotic bound as a curve dividing two different thermodynamic 
phases of codes. 



Introduction 

In this article, we address again two related basic problems about asymptotic 
bounds for codes, discussed recently in [Man5] and in [ManMar]. The first one 
is the problem of computability, or, more suggestively, plottability of the bound. 
The second one is the problem of interpretation of this bound as a kind of phase- 
transition curve. 
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We start the sec. 1 below with precise definitions of the relevant notions, and 
the reader may wish to turn to it immediately. Here we restrict ourselves to some 
explanations on the intuitive level. 

Consider all error-correcting block codes C in a fixed alphabet A with q let- 
ters. Each such code determines its code point {R{C),d{C)) in the plane (R:= 
transmission rate, d:= minimal relative Hamming distance). 

The asymptotic bound R = C(q{d) is a continuous curve in the plane (-R, 5) such 
that all limit points of the set of code points lie below or on this bound, whereas 
all isolated code points lie above it. 

Since 1981, when the existence of the asymptotic bound (and its versions for 
various structured codes, such as linear ones) was discovered in [Manl], many esti- 
mates for it from above and from below were established, but no exact formula was 
found. This led one of us to conjecture in [Man5] that the function R = C(q{d) might 
be uncomputahle (and its graph unplottable) in the technical sense formalized in 
[La] and [Brat We]. 

Here we treat codes assuming that an oracle is given that produces them in 
the order of increasing Kolmogorov complexity, and show that with the assistance 
of such an oracle R = aq{d) becomes "plottable" (sec. 2), and that appropriate 
partition functions involving either the Kolmogorov complexity, or Levin's prefix 
complexity ([Levi]) change behavior across this asymptotic bound (sec. 3). (At the 
end of [Man3] it was argued that civilization is such a universal oracle). 

Slightly more precisely, it is known (see [CoGo], [BaFo], [VlaNoTsfa], and sec. 
1 below) that if one chooses first a natural enumeration of codes and then gener- 
ates error-correcting codes in the order of their "size" (actually, any computable 
order rather than complexity), then the density of the respective code points is con- 
centrated below or near the bound (1.2) (for linear codes (1.3)), that lies in turn 
strictly below the asymptotic bound. Complexity is invoked here principally in 
order to identify typical random codes with codes whose complexity is comparable 
with their size: cf. [ZvoLe], [Lev2], [LiVi] 

By contrast, we show that code points of codes generated in the order of growing 
complexity, that puts a considerable amount of highly non-random codes at the 
foreground, tend to be well distributed below the asymptotic bound, with the bound 
itself appearing as a "silver lining" of the cloud of code points. 

The last sec. 4 is dedicated to a sketch of "quantization" of the classical ensemble 
of codes. 



1. Asymptotic bound as a non— statistical phenomenon 

1.1. Codes and code points. Here we recall our main definitions and results 
of previous works. 

We choose and fix an integer q > 2 and a finite set, alphabet A, or Ag, of 
cardinality q. An (unstructured) code C is defined as a non-empty subset C C A^ 
of words of length n > 1. Such C determines its code point 
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{R{C),5{C)) 



in the (i?, 5)-plane, where R{C) is called the transmission rate and 5(C) is the 
relative minimal distance of the code. They are defined by the formulas 

5(C) := ^^, rf(C) :=min{rf(a,6)|a,6GC,a^6}, ?i(C) := n, 
n{C) 

^(C) = i^, MC):=log,card(C), (1.1) 

where d{a, b) is the Hamming distance 

d{{ai), {bi)) := card{z G (1, . . . , n) | a^ 7^ bi}. 

In the degenerate case cardC = 1 we put d{C) = 0. We will call the numbers 
k — k{C), n = n{C), d — d{C), code parameters and refer to C as an [n, k, (i]g-code. 
We denote by CodeSq the set of all such codes, and by cp : CodeSq — )■ [0, 1]^ fl Q^ 
the map C 1— ?> Pq- The multiplicity of a code point x is defined as cardinality of 
the fiber cp~^{x). 

If g is a prime power, and Aq is endowed with a structure of a finite field F^, 
then a linear code is a linear subspace of A^. The set of linear codes is denoted 
CorfesJ'". 

Our starting point here will be the following characterization of the set of all code 
points, first proved in its final form in [Man5]. Note that i?-axis is traditionally 
drawn as vertical one. 

1.2. Theorem. There exists such a continuous function aq{6), d G [0,1], that 



(i) The set of code points of infinite multiplicity is exactly the set of rational 
points (-R, 5) G [0,1]^ satisfying R < aq{d). The curve R = aq{d) is called the 
asymptotic bound. 

(a) Code points x of finite multiplicity all lie above the asymptotic bound and 
are isolated: for each such point there is an open neighborhood containing x as the 
only code point. 

(Hi) The same statements are true for linear codes, with, possibly, different as- 
ymptotic bound R = a^"^ (d) . 

1.3. Good codes. One characteristic of a good code is this: it maximizes 
simultaneously the transmission rate and the minimal distance. From this perspec- 
tive, good codes are isolated ones or lying close to the asymptotic bound. Below 
we briefly describe known results showing that "most" randomly chosen codes are 
not good. To the contrary, in the next section we show that in order to recognize 
good codes one must generate codes of low Kolmogorov complexity, that is codes 
allowing short programs producing them. 

This is exactly what has happened historically, when algebraic geometric codes, 
discovered by Goppa, were used by Tsfasman, Vladut and Zink in order to amelio- 
rate the Gilbert-Varshamov bound: cf. an early survey [ManVla] and [VlaNoTsfa] . 

In this sense, moral of this note is just opposite to the title of [CoGo]: Only 
codes about which we can think can be good. 

1.4. Shannon's ensemble. We sketch here some well known arguments and 
results (see e. g. [CoGo], [BaFo], [VlaNoTsfa] and references therein) showing that 
most (unstructured) g-ary codes lie lower or only slightly above the curve 

R=l{l-Hq{S)) (1.2) 

where Hq{d) for < 5 < 1 is the g-ary entropy function 

Hq{S) = Slogqiq - 1) - 5logq5 - (1 - 5)log,(l - 5). 

Notice that inside [0, 1]^ lies only the part of this curve for which < R < 1/2. 
The Gilbert-Varshamov bound 

R = l-Hq{S) (1.3) 



plays a similar role for linear codes: cf. Remarks below. 

In order to make the statements above precise, one introduces Shannon's Random 
Code Ensemble RCE^ of g-ary codes of block length n. Each code in RCE^ is 
a set of pairwise different words in A^ chosen randomly and independently with 
uniform probability q~"'. 

1.5. Proposition ([CoGo, sec. V]). (i) For any e > 0, the probability 
(in RCEn) that Hq{d/n) > max(l — 2i?, 0) + e, where R = k/n, is bounded by 

q-en{l+o{l)) as n^ oo. 

(a) Similarly, the probability that Hq{d/n) > 1 — R + e is bounded by e~'^ 
as n -^ oo. 

Strategy of the proof. One easily sees that the number of words at a (Ham- 
ming) distance < d from any fixed word in A^ is 

Yol,{n,d) = J2(fjiQ-^y- (1-4) 

As is well known, one can estimate this quantity in terms of the g-ary entropy: 

q(^'(^)-«W)'^ < Vo/,(n,n5) < Q^'(^)^ 

Following [CoGo], sec. V, denote by X^'^' the random variable on RCE^ whose 
value at a code is the number of unordered pairs of distict code words at a distance 
< d from each other. Clearly, on codes of cardinality q^, we have from (1.4) 

E{X) = (^''\ ^'-1 (?)(Q~^)' ^ ^n[//,(d/n)-(l-2i?)]+o(n)_ 
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One can similarly calculate -E'(X^), and to use Chebyshev's inequality to prove (i). 
The last statement is obtained along the same lines. For details, see [CoGo]. 

1.6. Remarks. For unstructured code points {R,S) with 1 — Hq{5) < 2R the 
same reasoning shows so that the average number of pairs at distance d is large. 

In the case of linear codes, the relevant code points concentrate in an exponen- 
tially narrow neighborhood of the VG bound. 



Since linear codes have considerably smaller Kolmogorov complexity than the 
general ones, this behavior is compatible with our discussion in sec 2. below. 

1.7. Spoiling operations as computable functions on codes. The proof 
of existence of asymptotic bound (essentially, the only known one) is based upon 
the existence of three types of rather banal combinatorial operations on (general, or 
linear) codes that produce from a given code several codes with worse parameters. 
The subsequent combinatorial characterization of isolated code points as points of 
finite multiplicity, and proof in [Man5] that any point with rational coordinates 
below the asymptotic bound is a code point crucially uses these operations as well. 

In each class, the result of application of such an operation to a given code is by 
no means unique. Here, using the discussion in [ManMar], sec. 1 (that reproduces 
several earlier sources), we will define three total recursive spoiling maps 

Si : CodeSq — )■ CodeSq, i = 1,2, 3, 

that are compatible with the map C i— )> [n{C),k{C),d{C)]q and whose effect on 
code parameters is summarized below: 

5*1 : [n, /c, (i]g H-). [n + 1, /c, (i], 

5*2 : [n, /c, d]q I— 7> [n — l,kjd — 1] {if n > 1, /c > 0), 
5*3 : [n, /c, d]q I— 7- [n — 1, /c', d], where k — 1 < k' < k {if n > 1, /c > 1). 

In order to make 5"^ unambiguous, we simply choose the code with minimal number 
wrt some computable numbering from a finite set of codes that can be obtained in 
this way, and define it to be Si{C). 

Another (minor) point is to decide what to do if mild restrictions in brackets do 
not hold for C. The simplest solution that we adopt is to put then Si{C) = C. 

1.8. Block length and distance between isolated codes. We will show now 
that knowing the distance of an isolated code point x from its closest neighboring 
code point, we can estimate from above the maximal block length of a code mapping 
to X and hence also the multiplicity of x. Distance in [0, 1]^ is defined here as 
dist((a, 6), (c, d)) := max(|a — c|, |6 — d\). 



1.8.1. Proposition. Let {R,S) be an isolated code point and denote by p its 
distance from the closest neighboring code point. Assume that (i?, d) is the code 
point of some C C A^ . In this case we have 

AT ^ i ^~P ^- P 

JM < max I , 

P P 



Proof. If the code parameters of C are [A^, K, D\q, then the code parameters of 
Si{C) are [A^ + 1, K, D]q so that 

dist(Pc,Ps.(C)) = max ( J^, ^) > p. 



This shows our result, because 






2. Plotting asymptotic bound with assistance of a complexity oracle 

2.1. A general setup. Let X be an infinite constructive world, in the sense of 
[Man3]. This means that we have a set of structural numberings of X: computable 
bijections Z+ — )■ X, forming a principal homogeneous space over the group of total 
recursive permutations Z+ — )> Z+ 

Consider one such bijection u = ux '■ 2+ — )• X. It defines an order on X: x' < x 
iff i/~^(x') < u~^{x). Equivalently, we can imagine such a bijection as an order in 
which elements of X are generated: x is generated at the z/(x)-th step. 

Another important class of bijections that we have in mind consists of (un- 
computable) Kolomogorov orders defined and discussed in [Man3]. Namely, let 
u : Z"*" — )■ A be an optimal (in the sense of Kolmogorov and Schnorr) partial 
recursive enumeration. Then K^ix) := min {k \ u{k) = x} is the (exponential) 
Kolmogorov complexity, and the Kolmogorov order of A is the order of grow- 
ing complexity. If we denote the respective Kolmogorov order K.u{x), we have 
ciKu{x) < K.u{x) < C2Ku{x) for constants ci,C2 > depending only on u. Simi- 
larly, another choice oiu produces another order differing from K.u by a permutation 
of linear growth. 



Let now X,Y be two infinite constructive worlds, z/x,^y respective structural 
bijections, u : Z+ -^ X , v : Z+ — )> Y two optimal enumerations, and K^^Ky the 
respective Kolmogorov complexities. 

Consider a total recursive function / : X ^ Y. 

2.2. Proposition. For each y G f{X), there exists x E X such that 

y = f{x), Ku{x) < const ■ uy\y) (2.1) 

where the constant can he calculated in terms of u,v^i'x,t-'y- 

Proof. Informally, this means that we can effectively generate all elements of 
the (enumerable) image f{X) C y in their structural order, if an oracle generates 
for us all elements of X in the order of growing Kolmogorov complexity. 

In fact, denote byF: X— j-yxZ"*" the following map: 

F{x) := {f{x), n{x)), where n{x) := card {x' \ ^^^{x') < ^^^{x), f{x') = f{x)}. 

In plain words, n{x) is the number of x in the set f~^{f{x)) wrt the order induced 
by z/x- 

Clearly, F is a (total) recursive function. Hence the image E G Y x Z"*" of F is 
an enumerable subset of y x Z+. 

For each to G Z+, put 

Xm:^{xeX\ n{x) = m} CX, Y^ := f{X^) C Y. 

Then Xm (resp. Ym) is an enumerable subset of X (resp. Y), and the restriction 
of / upon Xm induces a bijection of these sets. Moreover, f{Xi) = f{X), so that 
we can define the partial recursive function </? : Y -^ X , with domain f{X), which 
is f~^ on its domain. 

Hence, for any x G Xi such that f{x) = y, we will have 

Kuix) = Ku{(piy)) < c^Kyiy) < c'l^Y^iy). 

for appropriate constants c^,c'. Here and below we use some basic inequalities 
involving complexities, proved e.g. in [Man4], VI. 9. 

This completes the proof. 



2.3. Finite vs infinite multiplicity. We keep notation of the previous sub- 
sections, in particular, / : X ^ Y is a total recursive function. For any y & Y, 
call 

mult (y) := card f~^{y) 

the multiplicity of y. The Proposition 2.2 shows that with assistance of a complexity 
oracle for X we can generate consecutively elements of Y of nonzero multiplicity. 
For applications to codes, we want to divide them into two basic subsets: elements 
of finite multiplicity Yfi^ (they will correspond to isolated code points) and elements 
of infinite multiplicity Y^o- The latter will correspond to code points lying below 
or on the asymptotic bound. 

From the definitions above, it is clear that 

^oo c ■ ■ • C f{X^+i) C /(X^) C ■ ■ ■ C /(Xi) = /(X), 

and 

^oo = n^=i/(X^), Yf,^ = f{X) \ Y^. 

We have the following extension of the Proposition 2.2. 

2.3.1. Proposition. For each y G Y^o and each m > 1, there exists unique 
Xm £ X such that y = f{xm), n{xm) = rn, and 

Ku{xm) < const ■ Uy^ {y)mlog{uY^ {y)m) (2.2) 

where the constant does not depend on y, m and can he calculated in terms of 

U,V,l'x,l'Y- 

Proof. Define the partial function 

$ : y X Z+ ^ X 

with domain 

-D($) := { (y, m) I mult (y) > m} 

such that 

$(y, m) := the m — th element in the fiber f~^{y). 
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One easily sees that its graph is enumerable, hence $ is partial recursive. The 
element Xm in the statement of Proposition is just $(y, m). Therefore 

Ku{xm) = Ku{^{y,m)) < const ■ K((z/^^(y), m)). 

One can get various estimates of a chosen Kolmogorov complexity K of the pair 
(z/y(y),m) by choosing various structural numberings of the product of two con- 
structive worlds Y X Z+r cf. a more detailed discussion in sec. 2.7-2.10 of [Man4]. 
Here we use the standard estimate symmetric in both arguments: 

K((z/y(y),m)) < const ■K{uY{y))Kim)logiK{uY{y))K{m)). 

This completes the proof, since complexity on Z+ is majorized by identical function. 

Now, an oracle mediated process of generating sets Yoo, Yfin, can be described 
as the following inductive procedure. Choose a sequence of pairs of positive integers 
{Nm,m), m = 1,2,3..., A^^+i > N^. 

Step 1. Produce the list of all elements y G f{X) such that z/y^(y) < A''!. Denote 
this list Ai, and put Bi = 0. 

If lists (subsets) Am.Bm C f{X) are already constructed at the m-th step, go 
to 

Step m+1. Produce the list of all elements y E f{X) such that z/y^(y) < Nm+i- 
Denote by Am+i the subset of elements y in this list for which there exists x E X 
with f{x) = y, n{x) = m+1, and denote by Bm+i the subset of remaining elements. 
According to (2.2), we will have to ask the oracle to produce the list of x E X with 
explicitly bounded complexity, in order to be sure that this x with n{x) = m + 1 
appears in this list, if it exists at all. 

It is clear that Am U Bm C Am+i U -Bm+i, and that the union of this sequence of 
sets is f{Y). Moreover, Bm C Bm+i and Um^iBm = Yfi^. The passage from Am to 
Am+i generally involves both adding new elements y (with A^^ < z/y^(y) < Nm+i) 
and forwarding some of the elements of Am to Bm+i (whenever it turns out that 
in their fiber no new element of X appears) . 

2.4. A structural numbering of q'— ary codes. We will now explain how the 
general procedure described above can be applied to codes. More precisely, we will 
describe the constructive world of Q-ary codes X — CodeSg, the constructive world 
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of rational points Y = [0, 1]^ fl Q^ and the total recursive map f : X ^ Y, C \-^ 
cp{C). 

We fix q and the alphabet A of cardinality q. We fix also a total order on 
A, say, by identifying A with {0,1,...,^ — 1}. We then order all words in A" 
lexicographically. 

Define now the following computable total order (or, equivalently, computable 
bijective enumeration u) of the set of all non-empty codes CodeSq with /c > 0: 

a) If rti < ?i2, all codes in A^^ come before those in A^^. 

b) If /ci < /c2, all [n, /ci, (i]g-codes come before [n, /ci, (i']q-codes. 

c) When (n, q'^) are fixed, order all words in A"^ lexicographically, then consider 
induced order on words in any code C C A", and finally encode C by the concate- 

k 

nation of all its elements put in the lexicographic order. Such a word w{C) E A^'^ 
determines C uniquely. Finally, order all [n, /c, (i]g-codes in the lexicographic order 

of«;(C). 

Clearly, n(C), [A;(C)] + 1 and d{C) become total recursive functions CodeSq — )■ Z_|_ 
(condition k{C) > means that C contains at least two different words the distance 
between which is therefore positive). 

Finally, the function "code point" 

op(c) :^ m, '- 

\ n n 

is a total recursive map CodeSq — )■ [0, 1]^ fl Q^. 

2.5. Plotting asymptotic bound. Now apply to the codes the general proce- 
dure discussed above. Fix an enumeration uy of rational points in the unit square 
in some natural way. To make the picture visually transparent, choose the sequence 
Nm in such a way that the set of points with Uy (y) < N^ contains the subset Cm 
of all points with denominators of each coordinate dividing ml, and plot at the step 
m only the points of A^, Bm contained in Cm- 

Clearly, "pixels" in Cm form the vertices of the square lattice of radius 1/ml. 
Call a subset of Cm saturated, if it is a union of sets of the form Sa,b- {{x,y)\x < 
tt, y < b}, (a, 6) G Cm- To motivate this definition, recall that the subset of Cm 
lying under or on the asymptotic bound is saturated. Hence it must be contained 
in the maximal saturated subset Dm of Am H Cm - 
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The upper boundary r^, of this subset (consisting of horizontal and vertical 
segments of the length 1/ml that connect neighboring points) is our m-th approx- 
imation to the asymptotic bound. 

Obviously, Bm is the subset of isolated codes constructed at the m-th step. 

The status of any point that is above F^ but not contained in Bm will become 
clear only at a subsequent step. 

3. Partition functions for codes and phase transition effects 

3.1. Partition functions involving complexity. Let X be an infinite con- 
structive world. Following L. Levin ([Levi], [Lev2]), we will consider functions 

p : X ^ R>o U {oo}, 

that are enumerable from below in the sense that the set 

{{r,p{x)) I r < p{x)} C Q X X 

is enumerable. 

Furthermore, Levin introduces the notion of a quasinorm functional on the set of 
enumerable from below functions and shows that for any choice of such a functional 
N, the set of functions p with N{p) < oo admits a maximal one in the sense 
that it majorizes any other function after multiplication by an appropriate positive 
constant. 

We quote here two representative examples showing relation of this result to 
complexity: 

3.1.1. Proposition, (i) Let 

N{p) := sup (r ■ card {x \p{x) > r}). 

For this quasinorm, functional p{x) := Ku{x)~^ is a maximal function, where Ku 
is a Kolmogorov complexity on X . 

(a) Let 

N{p) := ^p{x). 
xex 
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For this quasinorm, functional, p{x) := KPy{x)~^ is a maximal function, where 
KPy is an (exponential) prefix-free complexity on X depending on the respective 
optimal "decompressor" v. 

Initial definition of prefix-free complexity involved the choice of the world of bi- 
nary words for X. However, Levin's result 3.1.1(ii) gives a very natural independent 
characterization of this complexity in terms of enumerable from below probability 
distributions on X whose definition uses only the fact that X is a constructive 
world. 

This construction shows that it is natural to consider at least two types of par- 
tition functions on a constructive world X that endow objects of low complexity 
with higher weight: Z(X,/3) = Ex6X^-(^)"^ and ZP{X,P) = E.ex^Pvix)-^ 
where (3 is the inverse temperature. Both absolutely converge for /3 > 1 and di- 
verge for /3 < 1. The difference is that the first one diverges at /? = 1, whereas the 
second one converges for /3 = 1 as well. Divergence is easily seen, if one replaces 
Kolmogorov complexity by Kolmogorov order, in which case the partition function 
becomes simply Riemann's Cif^)- 

In the following we will use versions of Z(X, (3), in particular, because the usual 
Kolomogorov complexity was extended to the nonconstructive world of partial re- 
cursive functions (e. g. in [Sch] and in the first 1977 edition of [Man4]), and this 
allowed one to prove for it a number of useful estimates. Here is the simplest one. 

3.2. Lemma. Let Y be an infinite decidable subset of a constructive world X. 
Endowing Y with the induced structure of constructive world, choose two exponen- 
tial Kolmogorov complexities Ku{X, *), resp. K^iY, *) of the objects in X , resp. in 
Y . Then the restriction of Ku{X, *) to Y is equivalent to K^iY^ *) in the sense that 
one of these functions multiplied by a positive constant majorizes another one. 

Proof. The embedding i : Y ^ X is total recursive function. Define the 
function j : X ^ Y a,s identity on Y , and taking a constant value yo (z Y on the 
complement X \Y . Since this complement is enumerable, j is total recursive as 
weU. Hence Ku{i{y)) < ciKy{y), Ky{j{x)) < C2Ku{x). 

3.3. Phase transition. Since the function aq{5) is continuous and strictly 
decreasing for 5 G [1,1 — q~^), the limit points domain R < aq{5) can be equally 
well described by the inequality 5 < f3q{R) where (3q is the function inverse to aq. 
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Fix now an i? G Q n (0, 1). For A G Q n (0, 1), put 

Z(i?,A;/3):= Yl K^{C)-^+'^^^-\ (3.1) 

C: R(C)=R,A<S{C)<1 

where Ku is an exponential Kolmogorov complexity on CodeSq. 

3.3.1. Theorem, (i) // A > l3q{R), then Z{R,A;l3) is a real analytic function 
of (3. 

(a) If A < l3q{R), then Z{R,A;(3) is a real analytic function of 13 for 13 > l3q{R) 
such that its limit for (3 — f3q{R) — )■ +0 does not exist. 

Proof. If A > l3q{R), then all codes in the summation domain of (3.1) are 
isolated ones, and there is only a finite number of them, hence Z{R,A;I3) is real 
analytic. Otherwise, this set of codes is infinite decidable subset of CodeSq, and 
one can appeal to 3.2. 

3.3.2. Comments. To embed the statement of Theorem 3.3.1 in a conventional 
environment of thermodynamics, one should have in mind the following analogies. 
The argument (3 in (3.1) is the inverse temperature, the transmission rate R is a, 
version of density p, so that our asymptotic bound transported into (T = /3~^, R)- 
plane as T = l3q{R)~^ becomes the phase transition boundary in the (temperature, 
density)-plane. 

3.4. Measures and the asymptotic bound. We now show that the plotting 
procedure described in Section 2 can be reformulated in terms of measures on the 
space of codes defined by the partition functions described above. 

The partition function Z{Xj(3) = XIxex -^"(^)~'^ determines a one-parameter 
family of probability measures on the space X of codes, for (3 > 1, given by 



Similarly, one obtains probability measures associated to the partition functions 
ZP{X,(3) and Z{R,A;l3), with the latter defined on the space of codes with pa- 
rameter R{C) = Randl- A< 5{C) < 1. 

Now consider again the oracle mediated process described in Section 2, gener- 
ating the sets Yoo = Vq nUq and Y/in = Vq \ {Vq fl Uq) of code points below and 
above the asymptotic bound, and the inductively constructed sets A^ and i?j„. 
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3.5. Proposition. The algorithm of Section 2 determines a sequence of prob- 
ability measures associated to the sets Am and Bm that converge to probability 
measures on the space of codes with parameters in Yfin and Y^o and a sequence of 
measures Pm,/? converging to a measure supported on the asymptotic bound curve. 

Proof. We work with the partition function Z{X, (3). The argument for ZP{X, j3) 
is analogous. On each of the sets S^ constructed by the oracle mediated al- 
gorithm of Section 2, one obtains an induced probability measure Pb^^/3(C) = 

K„(C)-'3z(cp-i(S^),/3)-\ where Z(cp-i(S^),/3) = Ececp-i(B™) ^-(C')'"^- Since 
all the code points in Yfi^ have finite multiplicity, and each B^ contains finitely 
many code points, the Zb^{P) are finite sums for all m > 1. Since the sets 
Bm C Bm+i are nested, in the limit to — )■ oo, the probability measures Pb^,/3(C) 
converge to the probability measure supported on cp~^{Yfin) given by Pfin,i3{C) ~ 
K^{C)-PZ{cp-HYf,rr),P)-' with Z{cp-\Yf,rr),^) = Ececp~HY,.^)Ku{C)-^. 

In the case of the sets A^, one has A^ = {Am H Am+i) U {Am H Bm+i), and one 
obtains the set Y^o = VgHUq of code points below the asymptotic bound as 

Y^= \J{[]Am+n). 
m>l n>0 



M,N 



Consider the sequence of sets Em,n = ^m=i ^n=o ^m+n- Then Em+i,n 3 E 
and Em,n+i C Em,n- Correspondingly, one has sequences of probability measures 

Ku{C)-^ 



E. 



{C) 



Z{cp-^{Em,n).P) 



with Z{cp ^{Em,n),(^) = Y.cecp-^{EM,N)^'^^^^ ^' ^^^^ converge, as M, A^ ^ oo 
to the probability measure Poo, f3{C) = Ku{C)~^ Z{cp~^{Yoo), (!^)~^j supported on 
codes in cp-^Yoo) with Z {cp-^' {Y^^) , /?) = Ececp-Hv^) Ku{C)-^. 

Consider then the sets Cm and Dm C Am H Cm constructed as in Section 2, 
and the upper boundary Vm of the set Dm approximating the asymptotic bound. 
Denote by Fm C Dm the region bounded between Vm and Dm H r^_i. Then the 
probability measures 

P. (C) - ^-^^^"' 

^-^"^^ z{cp-^{Fm),py 

with Z{cp~^{Fm), (3) = '^cecp-'^(Frn) ^'^(^^~^ converge to a probability measure 
Pr(C') = Ku{C)~^Z{Tj /3)~^, supported on the set of codes whose code points fall 
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on the asymptotic bound curve F = {{R,d) \R = aq{S)} itself, with Z(r,/3) = 

When using the partition function Z{R,/S.]I3) of (3.1), one has an analogous 
statement, with code points restricted to the domain R{C) = R and 1 — A < d{C) < 
1, except for the last statement about a measure supported at the asymptotic 
bound, because of the presence of a phase transition for Z{R, A; (3) precisely along 
that curve. 

4. From classical to quantum systems 

4.1. Computable functions as classical observables. In the subsections 
3.4-3.5 we have described a statistical mechanical system on the space of codes, 
where observables are computable functions on the space X of g-ary codes codes and 
the expectation values of observables are obtained by integrating these functions 
against the probability measure defined by the complexities, 

(/)/5 - / fiC) dFf,{C) = ^^ J2 /(^) Ku{C)-^ 

or similarly with the measures defined by ZP{X, (3) or Z{R, A, (3). 

In this section we describe a quantized version of this statistical system and 
explain the role of the asymptotic bound curve R = ag(/3) in this setting. 

The quantization of the system is achieved by considering code words as the 
possible independent degrees of freedom in an unstructured code, and quantizing 
them as independent harmonic oscillators, with energy levels that depend on the 
rate and the complexity of the code. 

We then show that, while code points that lie below the asymptotic bound give 
rise in this way to a bosonic field theory with infinitely many degrees of freedom, 
the code points above the asymptotic bound produce quantum mechanical systems 
with finitely many degrees of freedom. 

The partition function of the resulting quantum statistical mechanical system is 
different from the one we computed in Section 3 for the classical system, but it is 
easily derived from it and displays similar phase transition phenomena. 

We also show that the recursive algorithm of Section 2 provides a good approx- 
imation by systems with finitely many degrees of freedom for the quantum system 
associated to the set Yoo of code parameters. 
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4.2. Quantum statistical mechanical system of a single code. To make 
a single unstructured code C into a quantum system, we regard the code words 
as the possible independent degrees of freedom and we associate to each of them 
a creation and annihilation operator. This means that we consider, for each code 
word a; G C an isometry T^, with T^Tj; = 1 and such that the T^T* are mutually 
orthogonal projectors. This means that we associate to a given code the Toeplitz 
algebra Tc on its set of code words. This is the same kind of code algebra as we 
considered in our previous work [ManMar]. 

The algebra Tc is naturally represented on the corresponding Fock space, namely 
the Hilbert space Tic spanned by the orthonormal basis e^ with w = xi...xn 
ranging over the set of finite sequences (of arbitrary length A^) of the code words 
X E C. In this representation, the operator Tj^ acts by appending x as a prefix to 
a given string of code words, T^e^ = e^w 

The dynamics of this quantum system is determined by a Hamiltonian operator 
H on the Fock space, via the time evolution 

We can then assign energy levels that depend on the complexity of the code in the 
following way. 

4.3. Lemma. The time evolution a : R — ?> Aut{Tc) given by at{Tj:) = 
Ku{Cy^ Tj: is generated by the Hamiltonian He^ = £{w) log^ Ku{C) eyj, with £{w) 
the length of the word w, and has partition function 

which is a real analytic function in the domain jS > nR/ log^ Ku{C). 

Proof. The Hamiltonian implementing the time evolution at{Tj:) = Ku{Cy^ T^ 
on the Fock space Tic is the operator H on Tic satisfying 

atiA) = q''^Aq-''", ^AeTc- 

This is given by the operator Hew = ^i^logg Ku{C) e™ for all words w = xi . . .Xm 
of length i.{w) = m. We then find 



Z{C, (7, /3) = Tr{q-^^) = ^(card Wm)q~^'" '°^' ^^^^^ =H^ 



va{nR-p\og K^{C)) 



1 ' 
m 
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where the cardinahty of the set Wm of words of length m is q'^'^^ since cardC = 
qk _ qTiR^ with n[C) = n the length of the code and R{C) = R the rate. This 
series converges to (4.1) for /3 > nR/log^KuiC). 

To compare the behavior of this partition function to the Z{R, A; /3) considered in 
Theorem 3.3.1, it is convenient to change variable in (4.1) by /3 i— )■ n{C){(3—d{C)+l). 
Then, using the Singleton bound on codes, we obtain the following. 

4.4. Corollary. The function Z{C,a,a), with a = n{l3 — d + 1) is real holo- 
morphic for all (3 > with 

Z{C,a,n{P-d + l))< 



Proof. The partition function Z{C, a, a) is given by the sum 

y^^mn(i?-(/3-5+l)log^K„(C)) < \" q^^i^+S-^) K^{C)~^'^ < VK^,((7)' 
m mm 



-pm 



where the first estimate uses Ku{CY ^ < q^ ^ and the second estimate uses the 
singleton bound for codes k <n — d — 1, which gives i? + 5 — 1 < 0. 

4.5. Quantum statistical mechanical system at a fixed code point. We 

can now consider quantum statistical mechanical systems involving several codes. 
Again, the main idea is that different codes with their degrees of freedom given by 
their code words are considered as independent uncoupled systems, which means 
that the algebra of observables describing the set -^(i?,5) of g-ary codes with fixed 
code parameters (i?, 5) becomes the tensor product of the Toeplitz algebras of the 
individual codes, 

'7^i?,5) =®C6X(«.,)7^ (4.2) 

acting on the tensor product of the Fock spaces T-i[R^s) = ®ceX(H s)'^c and with the 

product time evolution a)- = ®c(^? ^ with af^(Tr) = KuiCy^T^. The partition 
function becomes, correspondingly, the product of the partition function for the 
independent systems. In particular, for a = a{n, 13, S) = n{(3 — d + 1) a variable 
inverse temperature as in [ManMar], we find that 

Z(X(^,5),a,a)= n ^(C,a,n(/3-5 + l)), 
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is a finite product for (i?, d) G Yfin and an infinite product for (i?, d) G l^oo, whose 
convergence is controlled by the convergence of the infinite product 



This in turn converges whenever the series Z(Xji^S:(^) = X^cex Ku{C) ^ con- 
verges, which is the classical partition function for a fixed code point. 

This argument shows the role of the asymptotic bound from the point of view of 
these quantized systems. In fact, recall that an infinite tensor product of Toeplitz 
algebras is a standard way to describe the second quantization of a bosonic field 
theory, see for instance [Jul] and also Section 2 of [BoCo]. However, a finite tensor 
product is a purely quantum mechanical system, with only finitely many degrees of 
freedom. Thus, the asymptotic bound separates the region Yfi^ in the space of code 
parameters where the quantum statistical system {T(^R^s),cJt) is purely quantum 
mechanical (first quantization) from the region Yoo where it is a second quantization 
of a bosonic field. 

4.6. Oracle assisted QSM system construction. It is usually interesting 
in quantum statistical mechanics to construct explicit approximations to systems 
with infinitely many degrees of freedom by systems involving finitely many ones. 
The oracle mediated construction described in Section 2 provides us with this kind 
of procedure. Consider the sets Am and Bm described in Section 2. We can then 
consider the algebras 

■Am = ®cecp-^{Am)'T'c, and Bm = ^cecp-'^{B^)'Tc, 

acting on the tensor product of the Fock spaces, and endowed with the tensor 
product time evolution as above. Moreover, by denoting, as in the previous section, 
by Fm the region between the curves Tm and Dm H r^_i, one can consider the 
QSM system associated to the codes with code points in Fm, 

These give good approximations, by systems involving only finitely many degrees 
of freedom, to the bosonic field theory associated to the set Y^ of code parameters 
and to the asymptotic bound F. 
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